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Abstract 

Building upon the known generalized-quantifier-based first-order charac- 
terization of LOGCFL, we lay the groundwork for a deeper investigation. 
Specifically, we examine subclasses of LOGCFL arising from varying the arity 
and nesting of groupoidal quantifiers. Our work extends the elaborate theory 
relating monoidal quantifiers to NC 1 and its subclasses. In the absence of the 
BIT predicate, we resolve the main issues: we show in particular that no single 
outermost unary groupoidal quantifier with FO can capture all the context-free 
languages, and we obtain the surprising result that a variant of Greibach's 
"hardest context-free language" is LOGCFL-complete under quantifier-free 
BIT-free projections. We then prove that FO with unary groupoidal quantifiers 
is strictly more expressive with the BIT predicate than without. Considering a 
particular groupoidal quantifier, we prove that first-order logic with majority 
of pairs is strictly more expressive than first-order with majority of individuals. 
As a technical tool of independent interest, we define the notion of an aperiodic 
nondeterministic finite automaton and prove that FO translations are precisely 
the mappings computed by single-valued aperiodic nondeterministic finite 
transducers. 

Keywords: finite model theory, descriptive complexity, computational com- 
plexity, automata and formal languages 



1 Introduction 



In Finite Automata, Formal Logic, and Circuit Complexity [jStr94 ] , Howard Straub 



ing surveys an elegant theory relating finite semigroup theory, first-order logic, and 
computational complexity. The gist of this theory is that questions about the struc- 
ture of the complexity class NC 1 , defined from logarithmic depth bounded fan-in 
Boolean circuits, can be translated back and forth into questions about the express- 
ibility of first-order logic augmented with new predicates and quantifiers. Such a 
translation provides new insights, makes tools from one field available in the other, 
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suggests tractable refinements to the hard open questions in the separate fields, and 
puts the obstacles to further progress in a clear perspective. 

In this way, although, for example, the unresolved strict containment in NC 1 
of the class ACC°, defined from bounded-depth polynomial-size unbounded fan-in 
circuits over {AND, OR, MOD}, remains a barrier since the work of Smolensky 
|Smo87 |, significant progress was made in (1) understanding the power of the BIT 
predicate and the related circuit uniformity issues [ BIS90|1 , (2) describing the regular 
languages within subclasses of NC 1 flBCST92j , |PMT91| 1, and (3) identifying the all- 
important role of the interplay between arbitrary and regular numerical predicates 
in the status of the ACC versus NC 1 questio n ptr94] , p. 169, Conjecture IX.3.4]. 

Barrington, Immerman and Straubing [BIS9C] introduced the notion of a 
monoidal quantifier and noted that, for any non-solvable group G, the class NC 1 
can be described using first-order logic augmented with a monoidal quantifier for 
G. Loosely speaking, such a quantifier provides a constrained "oracle call" to the 
word problem for G (defined essentially as the problem of computing the product of 
a sequence of elements of G). 

Bedard, Lemieux and McKenzie [BLM92] later noted that there is a fixed finite 
groupoid whose word problem is complete for the class LOGCFL of languages re- 
ducible in logarithmic space to a context-free language [ |Coo71 , |Sud7£ ] . A groupoid 
G is a set with a binary operation satisfying no discernible property, and the word 
problem for G is that of computing the set of all legally bracketed products of a 
given sequence of elements of G. It is not hard to see that any context-free lan- 
guage is the word problem of some groupoid, and that any groupoid word problem 
is context-free (see [BLM93, Lemma 3.1]). 

It followed that LOGCFL, a well-studied class which contains nondeterministic 
logarithmic space [ 5ud78| and is presumably much larger than NC 1 , can be described 
by first-order logic augmented with groupoidal quantifiers. These quantifiers can be 
defined formally as Lindstrom quantifiers [ Lin66| 1 for context-free languages. 

In this paper, we take up the groupoidal first-order characterization of LOGCFL, 
and initiate an investigation of LOGCFL from the viewpoint of descriptive complex- 
ity. The rationale for this study, which encompasses the study of NC 1 , is that tools 
from logic might be of use in ultimately elucidating the structure of LOGCFL. We 
do not claim new separations of the major subclasses of LOGCFL here. But we 
make a first step, in effect settling necessary preliminary questions afforded by the 
first-order framework. 

Our precise results concern the relative expressiveness of first-order formulas with 
ordering (written FO), interpreted over finite strings, and with: (1) nested versus 
unnested groupoidal quantifiers, (2) unary versus non-unary groupoidal quantifiers, 
(3) the presence versus the absence of the BIT predicate. Feature (3) was the 
focus of an important part of the work by Barrington, Immerman and Straubing 
[BIS90] on uniformity within NC 1 . Feature (2) was also considered, to a lesser 
extent, by the same authors, who left open the question of whether the "majority- 
of-pairs" quantifier could be simulated by a unary majority quantifier in the absence 
of the BIT predicate [ BIS90| , p. 297]. Feature (1) is akin to comparing many-one 
reducibility with Turing reducibility in traditional complexity theory. 

Here we examine all combinations of features (1), (2) and (3). Our separation 
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results are summarized on Fig. [I] on p. [l8|. In the absence of the BIT predicate, we 
are able to determine the following relationships: 

• FO to which a single unary groupoidal quantifier is applied, written Q^ p FO, 
captures the CFLs, and is strictly less expressive than FO with nested unary 
quantifiers, written FO(QqL), which in its turn is strictly weaker than 
LOGCFL. A consequence of this result, as we will see, is an answer to the 
above mentioned open question from [|BIS9C |: We show that first-order with 
the majority-of-pairs quantifier is strictly more expressive than first-order logic 
with majority of individuals. 

• No single groupoid G captures all the CFLs as Q^FO, i.e. as FO to which 
the single unary groupoidal quantifier Q^f is applied, 

• FO to which a single non-unary groupoidal quantifier is applied, written 
QcrpFO, captures LOGCFL; our proof implies, remarkably, that adding 
a padding symbol to Greibach's hardest context-free language [Gre73], see 
also [ ABB97] ], yields a language which is LOGCFL-complete under BIT- free 
quantifier-free projections. 

When the BIT predicate is present, first-order with non- unary groupoidal quantifiers 
of course still describes LOGCFL. In the setting of monoidal quantifiers [BIS90], 
FO with BIT is known to capture uniform circuit classes, notably uniform ACC°, 
which have not yet been separated from NC 1 . We face a similar situation here: the 
BIT predicate allows capturing classes (for example FObit(Q(>p)' verifying TC° C 
FObit(QQrp) — LOGCFL), which only a major breakthrough would seem to allow 
separating from each other. We are able to attest to the strength of the BIT predicate 
in the setting of unary quantifiers, proving that: 

• (5qJ. p FO C (5Ji,° p FObit, i-e. (trivially) some non-context-free languages are ex- 
pressible using BIT and a single unary groupoidal quantifier, 

• FO(QqL) C FObit(Q(>p), i- e. (more interestingly) BIT adds expressivity even 
when unary groupoidal quantifiers can be nested. 

We also develop a technical tool of independent interest, in the form of an ape- 
riodic (a. k. a. group-free, a. k. a. counter-free) nondeterministic finite automaton. 
Aperiodicity has been studied intensively, most notably in connection with the star- 
free regular languages |Sch65], but, to the best of our knowledge, always in a deter- 
ministic context. Here we define a NFA A to be aperiodic if the DFA resulting from 
applying the subset construction to A is aperiodic. The usefulness of this notion lies 
in the fact, proved here, that first-order translations are precisely those mappings 
which are computable by single- valued aperiodic nondeterministic finite transducers. 

Section 2 in this paper describes our first-order framework and exhibits a link 
between standard formal language operations and unary generalized quantifiers. Sec- 
tion 3 introduces nondeterministic finite transducers and proves that they charac- 
terize first-order translations. Section 4 forms the bulk of the paper and develops 
the relationships between our logic-based LOGCFL subclasses. Section 5 concludes 
with a number of suggestions how to extend the results obtained here. 
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2 Preliminaries 



2.1 Complexity theory 

REG and CFL refer to the regular and to the e-free context-free languages respec- 
tively. The CFL results in this paper could be adapted to treat the empty string e 
in standard ways. We will make scant reference to the inclusion chain 

AC C ACC° C TC° C NC 1 C NL C LOGCFL = SAC 1 C P, 

where we assume familiarity with NC 1 , NL, and P, and recall that 

• AC (resp. ACC°) (resp. TC°) is the set of languages recognized by sufficiently 
uniform families of constant depth, polynomial size, unbounded fan-in circuits 
over the basis {A, V,->} (resp. over a basis consisting of {A, V} together with 
a single Boolean MOD g gate, defined to output iff q divides the sum of its 
input bits) (resp. over the basis consisting solely of -i and the MAJORITY 
gate, defined to output 1 iff at least half of its input bits are set), 

• LOGCFL is the set of languages logspace-reducible to a context-free language 
| Coo71 , Sud7Sf| ; alternatively, this class is SAC 1 , namely the set of languages 



recognized by uniform families of log depth, polynomial size, Boolean circuits 



in which A has bounded fan-in and the fan-in of V is unrestricted [ Ven91|. 



2.2 The first-order framework 

We consider first-order logic with linear order. We restrict our attention to string 
signatures, i.e. signatures of the form (P ai , . . . , P ag ), where all the predicates P ai 
are unary, and in every structure A, A \= P ai (j) iff the jth symbol in the input 
is the letter 04. Such structures are thus words over the alphabet (oi, . . . , a s ), and 
first-order variables range over positions within such a word, i. e. from 1 to the word 
length n. For technical reasons that will become apparent shortly, we assume here, 
as in the rest of the paper, a linear order on each alphabet and we write alphabets 
as sequences of symbols to indicate that order. 

Our basic formulas are built from variables in the usual way, using the Boolean 
connectives {A, V, ->}, the relevant predicates P ai together with {=, <}, the constants 
min and max, the quantifiers {3,V}, and parentheses. We will occasionally use 
the binary predicate BIT(x,y), defined to be true iff the xth bit in the binary 
representation of y is 1. We write BC(£) to denote the Boolean closure of the set £ 
of languages (i.e. closure under intersection, union, and complement) and BC + (£) 
to denote the closure under union and intersection only. 

Definition 2.1. Lindstrom quantifier. Consider a language L over an alphabet 
£ = (ai,d2,..- ,a s ). Let x be a fc-tuple of variables (each of which ranges from 
1 to the "input length" n, as we have seen). In the following, we assume the 
lexical ordering on {1,2,... ,n} k , and we write X\,X2,... ,X n k for the sequence 
of potential values taken on by x. The groupoidal quantifier Ql binding x takes 
a meaning if s — 1 formulas, each having as free variables the variables in x (and 
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possibly others), are available. Let 4>i(x), <f>2(%), . . . , 4> s -\(x) be these s — 1 formulas. 
Then Ql,x~[4>i{x), foix), • • • , <fts-i(x)] holds on a string it; = wi ■ ■ ■ w n , iff the word 
of length n k whose ith letter, 1 < i < n k , is 

ox if io |= <£i(Xj), 

a 2 if w [= ->0i(Xi) A 2 pQ), 

k a s iiw\= -.01 (Xj) A -ifoGXi) A ... A -^ s _i(Xi), 

belongs to L. Thus the formulas [(j>\(x), <f>2(x), . . . , s _i(x)] fix a function mapping 
an input word/structure w of length n to a word of length n k . This function is called 
the reduction or transformation defined by [<p\ (x) , {x) , • • • , 0<j_i(x)]- In case we 
deal with the binary alphabet (s = 2) we omit the braces and write Qlx4>(x) for 
short. 



Definition 2.2. A groupoidal quantifier is a Lindstrom quantifier Ql where L is a 
context-free language. 



The Lindstrom quantifiers of Definition 2.1 are more precisely what has been 



refered to as "Lindstrom quantifiers on string" [BV98]. The original more general 
definition [Lin66] uses transformations to arbitrary structures, not necessarily of 
string signature. However, in the context of this paper reductions to CFLs play a 
role of utmost importance, and hence the above definition seems to be the most 
natural. 

The terminology "groupoidal quantifier" stems from the fact that any context- 



free language is a word problem over some groupoid [BLM93, Lemma 3.1], and 
vice-versa every word problem of a finite groupoid is context-free. Thus a Lind- 
strom quantifier on strings defined by a context-free language is nothing else than 
a Lindstrom quantifier (in the classical sense) defined by a structure that is a finite 
groupoid multiplication table. 



Barrington, Immerman, and Straubing, defining monoidal quantifiers in [BIS90], 
in fact proceed along the same avenue: they first show how monoid word problems 
can be seen as languages, and then define generalized quantifiers given by such 



languages (see HBIS90| , pp. 284f.]). 

We refer the reader to standard texts for formal details on the semantics of 
our logical framework. For instance, Definition 2A skims over the semantics of 
a groupoidal quantifier in the case in which the underlying formulas contain free 
variables other than those in x. We find Straubing's handling of these issues |Str94| 
particularly convenient and we will occasionally refer to his treatment. 



2.3 Groupoid-based language classes 

Here we define our first-order language classes precisely. Fix a finite groupoid G. 
Each SCG defines a language W(S, G) composed of all words w, over the alphabet 
G, which "multiply out" to an element of S when an appropriate legal bracketing 
of w is chosen. 
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Definition 2.3. QqFO is the set of languages describable by applying a single 
groupoidal quantifier Ql to an appropriate tuple of FO formulas, where L = W(S, G) 
for some S C G. 

QcrpFO is the union, over each finite groupoid G, of QgFO. 

FO(<5g) and FO(QGrp) are defined analogously, but allowing groupoidal quantifiers 
to be used as any other quantifier would (i. e. allowing arbitrary nesting). 
Q'qFO and FObit(Q^ p ), etc, are defined analogously, but possibly allowing the 
BIT predicate (signaled by subscripting FO with bit) and/or restricting to unary 
groupoidal quantifiers (signaled by the exponent "un"). 

We use FO(+) to denote that the additional predicate "x + y = z" (with the 
obvious semantics) is additionally allowed. It is known that FO(+) can express 



exactly the semi- linear sets (see [Har78, p. 231]). 



2.4 Unary quantifiers and homomorphisms 

We will encounter unary groupoidal quantifiers repeatedly. Here we show how these 
relate to standard formal language operations. Recall that a length-preserving homo- 
morphism X* — > A* is the unique free monoid morphism extending a map h : S — > A 
for finite alphabets X, A. In a different context, a result very similar to the next 



theorem is known as Nivat's Theorem [MS97, Theorem 3.8, p. 207] 



Theorem 2.4. Let B be an arbitrary language, and let A be describable in Q^FO, 
that is, by a first order sentence preceded by one unary Lindstrom quantifier 
(i. e. binding exactly one variable). Then there are length-preserving homomorphisms 
g, h and a regular language D such that A = h{D n g^ 1 {B)). 

Proof. Let A be defined by the formula ip G Q^FO, ip = Qbx<P{x), B CP (assum- 
ing r = (0, 1) initially). Let A be the underlying alphabet determined by the string 
signature. <f> thus defines a mapping from words over A to binary words. Define D 
to consist of all words [^] • • • [^ fe ] such that cp maps u\ ■ ■ ■ U}. to y\ • ■ ■ y^. Define the 
homomorphisms h and g by h: m i— > a and g: [?1 i— ► b for all a £ A and b G T. 
Then h(D n g^ 1 {B)) = A. But why is D regular? Intuitively, D is regular because 
FO languages are regular. Arguing formally requires a bit of care because each y^ 
depends on the truth value of an FO formula in which the variable x is instantiated 
with i. A proof that a finite automaton is able to determine yi can be found in 
Straubing [ JStr94 , pp. 23-24]. To see that D itself is regular, note that an NFA N 



can guess an incorrect (by guessing the position of the formal variable x in a 
V-structure, borrowing notation from Straubing) and verify that j/j is incorrect. In 
this way accepts the complement of D, so that D is regular^} 

The above strategy to show the regularity of D adapts to the case of a non- 
binary alphabet T, in which case A is a direct product of the NFAs accepting the 
languages defined by the relevant tuple of FO formulas. The homomorphisms g and 
h are unchanged. □ 



1 An alternative proof that D is regular is immediate from Theorem 3.5 
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Remark 2.5. FO precisely captures the variety of star- free regular languages 
[MP71], which allows us to even conclude that the D above is star- free. 



3 An automaton characterization of FO-translations 

As a technical tool, it will be convenient to have an automata-theoretic character- 
ization of first-order translations, i. e. of reductions defined by FO-formulas with 
one free variable. Since FO precisely describes the (regular) languages accepted by 



aperiodic deterministic finite automata |MP71f| , one might expect aperiodic deter- 



ministic finite transducers to capture FO-translations. This is not the case however 
because, e.g. the FO-translation which maps every string w\ ■ ■ ■ w n to cannot be 
computed by such a device. 

We show in this section that the appropriate automaton model to use is that 
of a single-valued aperiodic nondeterministic finite transducer, which we define and 
associate with FO-translations in this section. But first, we discuss the notion of an 
aperiodic NFA. 

Definition 3.1. A deterministic or nondeterministic FA M is aperiodic (or group- 
free) iff there is an n € N such that for all states s and all words w, 

5(s,w n ) = S(s,w n+1 ). 

Here 5 is the extension of M's transition function from symbols to words. Observe 
that if M is nondeterministic then 5{t, v) is a set of states, i. e. locally here we abuse 
notation by not distinguishing between M's extended transition function 5 and the 
function 5* as defined in the context of a nondeterministic transducer below. 



Remark 3.2. This definition of aperiodicity for a DFA is the usual one (see [ Ste85| ). 



For a NFA, a statement obviously equivalent to Definition 3.1 would be that A is 
aperiodic iff applying the subset construction to A yields an aperiodic DFA. Hence 
|Sch65 1 a language L is star- free iff some aperiodic (deterministic or nondeterminis- 



tic) finite automaton accepts L. 

We now prepare the ground for the main result of this section, namely that single- 
valued aperiodic nondeterministic finite transducers characterize FO-translations. 

Definition 3.3. A finite transducer is given by a set Q of states, an input alphabet 
X, an output alphabet F, an initial state qo, a transition relation KQxSxTxQ 
and a set F C Q of final states. For a string w = W\ ■ ■ ■ w n £ X* we define the set 
Om{w) of outputs of M on input w as follows. A string v £ T* of length n is in 
Om(w), if there is a sequence sq = qo, s±, . . . , s n of states, such that s n £ F and, for 
every i, 1 < i < n, we have (sj-i, Wi, V{, Si) € 5. 

We say that M is single-valued if, for every w € £*, |Oa/(^)| = 1- If M is 
single- valued it naturally defines a function fu '■ X* — ► T*. 

For every string u E X* and every state s £ Q we write 5*(s,u) for the set of 
states s' that are reachable from s on input u (i.e., there are s\, . . . ,s\ u \ = s' and 
v\ ■ ■ ■ v\ u \ such that, for every i, 1 < i < \u\, we have U{, V{, Si) € 5). 



As per Definition M is aperiodic if there is an n G N such that for all states 
q and all strings w, 5*(q,w n ) = 5*(q,w n+1 ). 

We will need some basic properties of FO-logic on strings. 

Let A; be a fixed natural number and X an alphabet. For every string u we write 
Q k for the set of FO-sentences of quantifier-depth k that hold in u. Let S k denote 
the set {& k | u G X*}. It is well-known that S k is finite, for every fixed k and X. 

Lemma 3.4. Let u,u',v,v' be strings such that & k = and = T/ien 

^■tre u'v' 

Proof. As <3?^ = and & k = we know that the duplicator has a winning 
strategy in the fc-round Ehrenfeucht game on u and u' and in the game on v and i/. 
These strategies can be easily combined to get a strategy on uv and u'v' . From the 
existence of this winning strategy we can, in turn, conclude that $ k v = & k i v i • □ 

Theorem 3.5. A function f : X* — > T* is defined by an FO translation if and only 
if it is defined by a single-valued aperiodic finite transducer. 

Proof. To simplify notation we assume that T = (0, 1). The proof of the general 
case is a straightforward generalization. 

(only if) Let / : X* — » T* be defined by formula <p(x) of quantifier-depth k (hence, 
for every w G X* and every i < \w\, the i-th bit of f{w) is 1 iff w \= <p(i))- We define a 
single- valued aperiodic finite transducer M with input alphabet X, output alphabet 
T, set S k x S h L){qo} of states, initial state qo and accepting states {($, | $ 6 S k }. 
Informally, a state (^i,^) of M represents a situation, in which M "knows" that 
<I>i contains exactly those formulas (of quantifier depth k) that hold in the prefix of 
the input string that was already read, and it "guesses" that $2 contains exactly 
those formulas that hold in the remaining part of the string. 

The transition relation 5 of M is defined as follows. For every $1, $2> $2 €. S k , 
every a G X and every r G T we let 

(($ 1) * 2 ),a,T,($ / 1 ,$ 2 )) £<5, 

if there exist strings u, v G X* such that $1 = = $^ (T , <3? 2 = ^2 = ®v 

and r = 1 ucr-i; |= <p(\u\ + 1). 

Analogously, for every ,$' 2 £ S 1 ', every a G X and every r G T we define 

( 9o ,0-,r,($i,$ 2 )) G 8, 

if there exists a string v G X* such that = <J> 2 = an d t = 1 °"v \= fiX)- 
We first check that M is single- valued. Let w = wi ■ ■ ■ w n , and /(w) = v\ • • • v n . 
We set s = qo and, for every % > 0, Sj = ($J, r .^,$t i+ i-t« B )' % usin g Lemma gg, 
it is easy to verify that s n E F and, for every z > 0, we have (sj_i, Vj, Sj) G 8. 
Hence f(w) G Oa/(w). 

We have to show now that no string u = u\ ■ ■ ■ u n 7^ f(w) is in Om(w). Assume 
otherwise and let s' = qo, s[, ... , s' n be a sequence of states that outputs it. Let, for 
every i > 0, s[ =: (^j,Gj). First, it is easy to observe that, for every i > 0, ^ = 
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^toi-iiv As u 1S different from v there must be a j such that @j ^ ^w j+1 -w n (Note 
that from the definition of 5 it follows that (s,a, l,s') £ 5 implies (s, cr, 0, s') 6). 
We conclude that for every i > j, 0j ^ i+1 ... Wn - Assume, otherwise that i > j is 
minimal, such that 6, = &w i+1 -w n - By definition of 5 and as (s' i _ 1 , Wi, r, s'j) € 5 it 
follows immediately that 0£_i = $£, iU ^ +1 ... Wn , a contradiction. Hence, in particular, 
G n / i.e., s' n £F. It follows that M is single- valued and }'m = /• It remains 
to show that M is aperiodic. First of all, it is well-known, and can be shown by 
an Ehrenfeucht game argument [ EF95| ] that, for n = 2 k and every w € X* it holds 



k 



Let now $i, $ 2 , $2 € 5 fe and let u, i? G S* with $i = <j?£ and 3>' 2 = $(5. From 
Lemma |3.4j and the definition of 5 we can conclude that ($^,$2) £ <5*(( < E > i, < I ) 2)5 x ) 
if and only if $2 = ^„ and ^ = Hence, again with Lemma |3.4| , we get for 

every uu the following. 



$2 = and $ 'i = $ 

($i,$ 2 ) 6^(($i,$ 2 )^ 



This implies that M is aperiodic. 

(if) Let / be computed by a single-valued aperiodic finite transducer M = 
(Q, X, T, qo, 5, F). It is easy to check that, for every s, s' C Q, the language 

L(s,s) = {u I s G (T(s,u)} 

is accepted by an aperiodic finite automaton. Consequently, every L(s, s') is char- 
acterized by a FO formula ip s ' s . Let <p(x) be the formula 

V ^r(*) A P CT (x) A (/<'*"(*). 

s"eF/\(s',(7,i,s')ei5 

Here, for every s and s', (x) is the formula that is obtained by relativizing ip s,s ' 
to all positions that are smaller than x and c/?> s (x) is the formula that is obtained 
by relativizing (p s,s to all positions that are greater than x (see for example |Str94, 
pp. 81f]). 

Hence, for every position x, (p(x) becomes true in a string uu if and only if there 
are states s, s', s" such that 

• M can reach s from the initial state by reading the string left to x, 

• M can reach s' from s by reading the symbol at position x and output a 1, 
and 

• M can reach the final state s" from s' by reading the string to right to x. 

As M is single- valued, ip(x) defines /m(w), for every w. □ 



4 First-order with groupoidal quantifiers 
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4.1 The largest attainable class: LOGCFL 



Theorem 4.1. There is a fixed groupoid G such that 

Q G FO bit = FO bit (Q Grp ) = LOGCFL. 

Proof. QcFObit C FObit(QGrp) holds by definition for any groupoid G. To see that 
FObit(Qcrp) C LOGCFL, note that [ BIS90 , Theorem 8.1] implies the existence of 



a logspace-uniform AC°-reduction, from any language in FObit(<5crp) 5 to a set of 
groupoid word problems. The unbounded fan-in AND gates in the AC reduction 
can be replaced by log depth bounded fan-in sub-circuits. Then the groupoid word 
problem oracle gates, of which no more than a constant number can appear on any 
path from circuit inputs to circuit output, can be expanded into SAC 1 sub-circuits, 
since groupoid word problems are context-free languages. There results a logspace- 
uniform SAC 1 circuit, proving membership in LOGCFL. 

LOGCFL C QcFObit is seen by appealing to the fixed G whose word problem 
is LOGCFL-complete under DLOG TIME reducibility |BLM9|. Since DLOGTIME 



was shown expressible in FObit by [ BIS90|1 , the inclusion follows. □ 



4.2 Capturing LOGCFL without BIT 

Theorem 4.2. There is a fixed groupoid G such that LOGCFL C QqFO. 

Proof. We first show how to express plus and times and their negations as 
FO + (Qcrp) formulas (i.e. formulas which have outside of the groupoidal quanti- 
fier only a first-order quantifier prefix and in particular no negation). 

Let us look at the predicate "a • b = a" Define L =d e f { w G (0, 1, jf)* | \w\q = 
\w\\ } and 

<j)(a, b, c) = de f Ql(x, y, z) [(z = min) A (x < a) A (y < b), (z = y = max) A (x < c)] . 

Given a word w of length n and assignments for a, b, c, the transformation [z = 
min Ax<aAy<b,z = y = max A x < c] yields a string of length n 3 over the 
alphabet (0, 1, #) which contains a • b many 0s, c many Is, and n 3 — ab — c many 
#s. Thus this image is in L if and only if a ■ b = c. 

Observe that L is deterministic context-free, therefore its complement is context- 
free and we conclude that we can also express a ■ b ^ c by a FO + (QcrpFO) formula 
(in fact even by a QcrpFO formula). 

In a similar way we can express a + b = c and a + b ^ c by FO + (Qcrp) formulas. 
All context-free languages involved in the definition of these predicates can be com- 
bined into one language Lq, which is context-free and co-context-free. Now integer 
addition and multiplication are enough to simulate the BIT predicate. Indeed it can 
be shown that exponentiation can be defined from addition and multiplication (see 
e. g. pP93| , p. 301] and [|Smo91| , p. 192]), and from this it is not so hard to define the 



BIT predicate, as pointed out by [ |Lin94 ] (cf., |mm9|). We conclude that there is a 



FO + (Qcrp) formula for the bit predicate. The only groupoid quantifiers needed in 
this definition are Ql quantifiers, and they are applied to quantifier- free formulas. 
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From Theorem 4.1 we know that LOGCFL = QcrpFObit- Thus every set in 
A G LOGCFL can be defined by a formula 



Q L x[$i,...,Q a ], (1) 

where each <3?j is a FObit formula. 

We will show how every such formula can be transformed into Q//FO-formula, 
for some fixed context-free language L' . 

Using the argument above we can replace each <3?j in (|l|) by a formula without 
bit, but using the Ql quantifier. This formula can then be transformed into the 
form 

3xiVx 2 3x 3 ■ ■ ■ \f /\<pi u i 2 , (2) 

where each of the 4>i lt i 2 is either a positive atomic formula or a formula of the form 
Ql X, where x is quantifier-free. 

Now we combine stepwise the inner quantifiers Ql (1 < j < m) in formula 
(0) with the first-order connectives Vi A an d the first-order quantifiers 3, V. We 
give the construction for the case of an existential quantifier. Consider the formula 
■^xQlJj^i, . . . , £fc-i] , where L\ C A* is context-free and co-context-free. Suppose 
A = (ai, . . . , dk), # A. Let y = (m, . . . ,yi). This formula is equivalent to 
Q L2 (x,z,yx,. . .,yi)[£o,£x, • • where 

L 2 = { w G (ai, . . . ,a fc ,#)* | w = wi# + w 2 # + ■ ■ ■ # + w n # + , Wi G Li for some i }, 

£o is the formula z > 1 and each 1 < i < k — 1, is the formula z = 1 A £j. 
The transformation / defined by [£qj£1j • • • >£fe-i] m aps a word to of length n to 
a word /(u?) of length n l+2 . f(w) consists of n blocks u\,...,u n of length n' +1 
each: f(w) = u\ ■ ■ ■ u n . Here u m corresponds to the assignment x = m. Each u m 
consists of n blocks of length n , one block for each value of z. These blocks are all 
in ^* for z > 1, and consist of a word over A for z = 1. This word is exactly the 
word to which w is mapped under the transformation [£i, ... , i], when x = m. 
Hence we see that f(w) G L 2 if there is some value m such that u m G This 
proves the correctness of the above construction. Certainly L2 is context-free, and 
since the complement of L\ is context-free, we see that the complement of L 2 is also 
context-free (the construction of appropriate PDAs is obvious). 

The combinations of a Ql with a universal quantifier, or with a first-order 
connective, are dealt with analogously. 

We thus replaced the sub-formulas $j in formula (|l|) above and obtained a for- 
mula of the form 

QLxfti,...,*.], (3) 

where each is of the form Ql^i, ipi is quantifier- free, and Lj is context-free and 
co-context-free. Let L C Aq, where Aq = (a%, . . . ,a s ), #,$0 Aq. Let B =d c f 
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(oi, • • • , a s , #, $). We now define a substitution h by 

h{ai) = $L X #B* 
h(a 2 ) = $L^#*L 2 #B* 



h(ai) = $L 1 #*---#*L i - 1 #*L i #B* 



h(a s ) = $L X #*L 2 #* •••#%,_!#* 

and let L' =d e f h(L). Our formula replacing ([!]) then is 

Q L , ![*[,..., *' s+1 ], (4) 

where we have to construct the formulas ^[ such that the following holds: Given a 
word w, suppose the transformation given by . . . , Vl/ S _i] produces for a certain 
assignment of the variables x the letter a £ A; more specifically: suppose that 
ipi produces Wi (for 1 < i < s — 1). Then . . . ,^' s+ i\ has to produce a word 
$u>i#*u>2#* • • • #*w s _i#*. Certainly this can be done with quantifier-free formulas. 

Thus we have shown that LOGCFL C Qq ip FO. Now define H to be Greibach's 
hardest context-free language. Any cfl L reduces to H via a homomorphism (see 



[ABB97, p. 137]. This homomorphism is e-free but not length-preserving. Applying 
a non-unary groupoidal quantifier to simple FO-formulas can realize this homomor- 
phism, provided that a new padding or neutral symbol be introduced, to act as a 
filler in any word. Thus we see that any Q^FO formula can be transformed into an 
equivalent Q pa d(H)FO formula. □ 

A corollary to this proof is the following remarkable result: 

Corollary 4.3. Greibach's hardest context-free language with a neutral symbol is 
complete for LOGCFL under quantifier-free projections without BIT. 



A noteworthy strengthening of Theorem 4.1 thus follows from Theorem 4.2: 



Corollary 4.4. Q G r P FO = FO(Q G r P ) = LOGCFL. 



4.3 Unary groupoidal quantifiers 

In the previous subsection, we have shown that the situation with non-unary 
groupoidal quantifiers is clearcut, since a single such quantifier, even without the 
BIT predicate, captures all of LOGCFL. Here we examine the case of unary quan- 
tifiers. In this case, the presence or absence of the BIT predicate is once again 
relevant. 
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4.3.1 Unary groupoidal quantifiers without BIT 
Theorem 4.5. <2£ n rp F0 = CFL. 



Proof. The direction from right to left follows from [BLM93]: Every context-free 



language reduces via a length-preserving homomorphism to a groupoid word prob- 
lem. We can even look at the letters in a given word as groupoid elements. This 
reduction can be expressed in FO. 



The direction from left to right is proved by appealing to Theorem 2.4 and 



observing that the context-free languages have the required closure properties. □ 

It follows immediately that nesting unary groupoidal quantifiers (in fact, merely 
taking the Boolean closure of Q'q^FO) adds expressiveness: 

Corollary 4.6. 

Q£P rp FO = CFL C BC + (Q£ n rp FO) = BC+(CFL) 
C BC(Q£ n rp FO) = BC(CFL) 
C FO(0gp). 

Proof. All inclusions from left to right are clear. The first separation follows from 
the fact that CFLs are not closed under intersection. The second separation follows 
from considering the non-context-free language Y, consisting of all words of the form 
ww, the complement of which is context-free. □ 



The inclusion CFL C QqJ. FO in Theorem [4.5| could have be proved alternatively 



by observing that the logic 3MFO capturing CFL (see | LST94 |) is closed under FO 
translations. We note in the same vein: 

Theorem 4.7. Q G r P MSO = CFL. 



Proof. In [ [LST94[ it is in fact proved that CFL = 3MMSO. This logic is closed 



under monadic second-order (MSO) transformations. Hence CFL C Q(j rp MSO C 
3MMSO C CFL. □ 



Can we refine Theorem 4.5 and find a universal finite groupoid G which cap- 



tures all the context-free languages as QJiPFO? Intuition from the world of monoids 



[ BIS90 , p. 303] suggests that the answer is no. Proving that this is indeed the case 



is the content of Theorem 4.£ below. We first make a definition and state a lemma. 

Let Dt be the context-free one-sided Dyck language over 2t symbols, i. e. Dt con- 
sists of the well-bracketed words over an alphabet of t distinct types of parentheses. 
Recall that a PDA is a nondeterministic automaton which reads its input from left 
to right and has access to a pushdown store with a fixed pushdown alphabet. We 
say that a PDA A is k- pushdown-limited, for k a positive integer, iff 

• the pushdown alphabet of A has size k, and 

• A pushes no more than k symbols on its stack between any two successive 
input head motions. 

Lemma 4.8. No k -pushdown-limited PDA accepts Dt when t > (k + l) k + 1. 
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Proof. Suppose to the contrary that a A;-pushdown- limited PDA A accepts Dt, where 
t = {k+\) k + \. A has a certain fixed number, s, of states. Consider A'b computation 
as it scans a length- ra prefix of its input. Since A is fc-pushdown-limited, no more 
than (k+l) kn different stack contents, hence no more than s-(k+l) kn configurations, 
are encountered. But A must be able to distinguish between each pair of length-n 
prefixes consisting of left parentheses alone, because for any two such prefixes v% 
and V2, there is a Dyck word v\w such that V2W is not a Dyck word. Now, it is easy 
to see that t n , the number of length-n words over an alphabet of t left parentheses, 
exceeds s ■ (k + l) kn when n is large. Hence A cannot accept Dt. □ 



Theorem 4.9. Any finite groupoid G verifies Qq 1 FO C CFL. 

Proof. Suppose to the contrary that G is a finite groupoid such that Q^FO = CFL. 
Then there is a FO-translation from each context-free language to a word problem 
for G. This means that a finite set of PDAs (one for each word problem W(-, G)) can 
take care of answering each "oracle question" resulting from such a FO-translation. 
By Theorem [T^, each FO-translation is computed by a single- valued NFA. Although 
the NFAs differ for different context-free languages (and this holds in particular when 
language alphabets differ), the NFAs do not bolster the "pushdown- limits" of the 
PDAs which answer all oracle questions. Hence if k is a fixed integer such that all 
word problems W(-,G) for G are accepted by a fc-pushdown-limited PDA, then for 
any positive integer t, Dt is accepted by a /c-limited-pushdown PDA. This contradicts 



Lemma 4.8 when t = (k + 1) +1. □ 



In the next subsection we will see that the BIT-predicate provably adds ex- 
pressive power to the logic Qq" p FO. Since it is known that BIT can be expressed 
either by plus and times | Lin94 | (cf., [|Imm98| ) or by the majority of pairs quantifier 



| BIS90] , the following two simple observations about the power of Qq" p FO are of 



particular interest. 
Theorem 4.10. The majority quantifier is definable in QqJ FO. 
Proof. Majority is a context-free language. □ 



Theorem 4.11. Addition is definable in Qq" p FO. 

Proof. Let i,j,k be positions in the input word. We want to express that i + 
j = k. We do this by using a quantifier for the context-free language L =<} e f 
{ 0* _1 al*60* _1 cl* i£N}. Given a word w € L, if symbol a is at position i and b 
is at position j, then c must be at position i + j. □ 



4.3.2 Unary groupoidal quantifiers with BIT 

What are Q^FObit and FObit^Grp) 7 lt would seem plausible that Q^pFO b it £ 
FO bit (Q£ n rp ) C LOGCFL, but we are unable to prove Q£ n rp FO bit C LOGCFL, much 
less FObit(Qcrp) £ LOGCFL. The next lemma indicates that proving the latter 
would prove TC° + LOGCFL, settling a major open question in complexity theory. 
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Lemma 4.12. TC° C FO bit (Qg 1 rp ). 

Proof. TC° is captured by first-order logic with bit and majority quantifiers 
PIS90| . □ 



Hence the BIT predicate is expressive and will be difficult to defeat. The next 
lemma is not surprising, but it documents the provable expressiveness of BIT. Recall 



that CFL = Q^ p FO (Theorem gj). 



Lemma 4.13. CFL C Q^ p FO bit . 

Proof. The language of all words whose length is a power of two is in FObit hence 
in the difference of the two classes. □ 

The remainder of this subsection is devoted to documenting a more complicated 
setting in which the BIT predicate provably adds expressiveness. We want to show 
that FO(QqL) C FObit (Q(>p), i- e. that even when unary groupoidal quantifiers can 
be nested arbitrarily, the BIT predicate adds strength. 

For this, we define, for strings u, w of equal length the operations u, u A w and 
hV w which denote the bitwise complementation of u, the bitwise AND of u and w 
and the bitwise OR of u and w. We say that a string w is (1, m) -bounded if it is in 
u* ■ ■ ■ u*, for some strings Uj with \u{\ < m, for every i. 

We are going to make use of the following Lemma. 

Lemma 4.14. Let u be an (I, m) -bounded 0-1-string and w an (I' ,m') -bounded 0-1- 
string, for some l,m,l',m' > 1, and \u\ = \w\. Then the following hold. 

(a) u is (I, m) -bounded. 

(b) u Aw and it V w are (5(1 + I'), mm') -bounded. 

Proof, (a) is trivial. We show (b) only for u A w, the argument for u V w being 
completely analogous. 

We show the statement by induction onl + l'. The induction starts with the case 
l = T = 1. 

In this case, u = u\ and w = w\, for some i,j, ui,wi, with \u\\ < m and 
\w\ \ < m' . 

Let uiowi denote the string Aw[ Ul ' of length |-ui||wi| < mm'. Further let d 
and r be chosen such that |n| = d|ni||wi| +r and r < mm' . Then uAw = (ui<>wi) d v 
for some v with \v\ = r. Hence u A w is (2, mm')-bounded. 

Now let I + V > 2. W.l.o.g. we can assume that u = u^u^u' and w = w\w' 
where |iii|, \u2\ < m, \w±\ < m', u' is (I — 2, m)-bounded, w' is (I' — 1, m')-bounded 
and > (n^ 1 1. 

Let < r < m' be such that + r is a multiple of \wi\. Let u^~ r be the word 
U2 rotated r positions to the left. It should be clear that, from position \u 1 ^ \ + r in 
u Aw onwards, the word («^~ r o w±) is repeated, as long as the u 2 2 portion of u and 
the w\ portion of w keep "overlapping" . We distinguish two cases. 

Case 1: \w\\ < \u^\ + (u^l, i.e. the "overlap" with u 2 2 runs out within w\. 
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There are i 2 , 



< 


u 2 2 


u' 


1 

w\ 


w' 


i 2 , 1x3,1x4 with \us |, \ui\ < m, such that 


< 




i" 
U4U2 


u> 


3 

Vfy 


w' 



It is not hard to see that we can write (u l ^u 2 uz) A w{ as 

(u\ O Wi) kl ViV2(u2~ T O Wi) k2 V3, 

for some v 2 of length r, some k±,k2, and some ui,i»3 of length at most mm'. As 
U/\u 2 v! is (Z, m)-bounded and u/ is (I' — 1, m')-bounded it follows by induction that 
U4U l 2 u' A w' is (5(1 + I' — 1), mm') -bounded. Altogether, u A w is (5(/ + l'),mm')- 
bounded, as required. 

Case 2: |u^| > |tt^| + \u 2 2 \, i.e. u^ 2 runs out first. 





u 2 2 


v! 




w' 



Hence, there are and ^2,^3 with \w2\, \ws\ < m' such that 



u = 




ui 2 


v! 


w = 


j 1 


W 2 


j" 
W3W1 


w' 



Now, u^u % 2 A (w{ w 2 ) can be written as 

(ui o wifHiv^u^ o wi) fc2 w 3 , 

where \v2\ = r and \vi\, \vs\ < mm' , hence this string is (5, mm')-bounded. Again, by 
induction, it follows that the remaining part of u A w is (5(1 + 1' — 1), mm')-bounded, 
which implies the statement of the lemma. □ 

Let S be a fixed alphabet, and let a denote the corresponding signature. Let 
if be a FO(+)-<r-formula with free variables x and y = yi,... ,yu- For every string 
w € £*, we write for the 0-1 string u = v±, . . . v\ w \ with vi = 1 iff i, y) |= 99. 

Lemma 4.15. Lei £ = {0} and 00 = {.Po}- Let ip be a FO(+) -a^-formula with free 
parameters x and y = yi, . . . , y^. Then there are I and m such that for every n and 
yi, ■ ■ ■ ,yk it holds that t%,(0 n ) is (I, m) -bounded. 

Proof. Let tp' be the FO(+)-0-formula which results from ip by replacing every sub- 
formula Po(t) by true, introducing a new free variable, n, and restricting all quanti- 
fiers relative to n. I. e., sub-formulas 3z9 are replaced by 3z(z < n) A 9 and \/z6 is 
replaced by \/z(z < n) — > 6. Then we get 

(0 n ,x,y) ^<p (N,n,x,y) \=tf/, 
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where N denotes the natural numbers. Using Presburger Quantifier Elimination 



(see [BJ89, pp. 220ff] or [ 5mo87 , pp. 320ff]) we can transform ip' into an equivalent 
quantifier-free formula ip which may additionally use the constants and 1 and 
binary predicates • = • (mod c), for some constants c. The atomic formulas of ip are 
of one of the following forms. 

• ax + bn + aiyi -\ a k y k = c, 

• ax + bn + aiyi + • • • a k y k < c, 

• ax + bn + aiyi -\ a k y k > c, 

• ax + bn + a±yi + • • • a k y k = c (mod d), 

for some constants a, b, c, d, ai. For every fixed n,yi,... ,y k , the first formula defines, 
via the above equivalence, a (3, l)-bounded string in 0*1*0*, the second and third 
formula define a (2, l)-bounded string in 1*0* and 0*1* respectively, and the last 
formula defines a (2, d)-bounded string in 0*(10 d ~ 1 )*. As ip is fixed, by inductively 



applying Lemma 4.14 we get constants I and to, such that, for every n,y, i^,(0 n ) is 



(I, m)-bounded. □ 
Theorem 4.16. FO hit (Q^ rp ) is not contained in FO(Q££ p ). 

Proof. We consider the language { O™ 2 | n 6 N } , which is even expressible in FObit 
and show that it is not in FO(QqJ. p ). 

In order to do so, we show that, for every unary language L in FO(QqJ p ), the 
set {i | l € L} is semi-linear (i. e. the finite union of some arithmetic progressions). 

It is enough to show that, over a one-letter alphabet, every formula of the kind 
Qsx<p with CFL B and first-order ip (with addition) can be replaced by a first-order 
formula with addition. 

Hence, let ip = Qbx<P, for some first-order (p (with addition) and CFL B. 

Let, besides x,y = y%,... , y k be the free variables of ip. 

By Lemma 4.15, there exist / and m such that, for every n and y, t^(0 n ) is (I, m)- 



bounded. Let u\, . . . , u p be an enumeration of all 0-1 strings of length at most m. 
Let L' denote the (regular) language, defined by («*••• u*) 1 . It follows that t^(0 n ) 

is in L', hence it can be written as n^ 11 • • • Up lp u^ 21 • • • u p 2p ■ ■ ■ u p lp (where, for each 
j = 1, . . . , I, all but one of the . . . , ij p are 0). For a word w G L' we write I(w) 
for the set of tuples (in, . . . , ii p ) with ■ ■ ■ = w. We show in the following 
that Ib '■= UwgBnz/ H w ) * s a semi-linear set. 

Let T = an,... ,a>ip,-- - ,&i P be a new (Zp-letter) alphabet and let h be the 
homomorphism defined by h(aij) = U{. Let r denote the Parikh mapping for strings 
a ii ' ' ' a tp- Then we have 



I B = T(h- 1 (BnL')na* n ---a i 



which is semi-linear by Parikh's theorem [ |Har7£ , Sect. 6.9] 



Hence, ip is equivalent to a FO(+) formula ||Har78| , p. 231]. By induction, 
we get that every FO(+)((5qJ. p ), hence also every FO(QqJ. p ) formula, over a one- 
letter alphabet is equivalent to a FO(+) formula. Hence { 0™ 2 | n € N } is not in 
FO(Qu? rp ). ' □ 
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It is interesting to see that the proof makes use of quantifier elimination twice, 
first to get the bounded strings, and second to show that { 0™ | n € N } is not in 
FO(+). 

As a particular case we can now solve an open question of [BIS90|, addressing 
the power of different arity for majority quantifiers. 

Corollary 4.17. Majority of pairs can not be expressed in first-order logic with 
unary majority quantifiers. 



Proof. In Theorem [4.10| it was observed that the unary majority quantifier can be 
simulated in FO(Q G " p )- On the other hand in [BIS90] it is shown that majority of 
pairs is sufficient to simulate the BIT predicate. But as FOyt (QcSp) is n °t contained 
in FO(<5 G " p ) the BIT predicate and hence the majority of pairs is not definable in 
FO(Qq 1 vp ), hence it cannot be simulated by unary majority quantifiers. □ 



In the same way, this time relying on Theorem |4.1l| , we obtain: 
Corollary 4.18. Multiplication is not definable in FO(QqJ. p ). 



5 Conclusion 

Fig. H depicts the first-order groupoidal-quantifier-based classes studied in this pa- 
per. Together with the new characterization of FO-translations by means of aperi- 
odic finite transducers, the relationships shown on Fig. |T] summarize our contribu- 
tion. 

LOGCFL = FO bit (Q Grp ) = Q Grp FO 
A 



FO bit (Q G n rp ) 
A' 



AC 




CFL = Q G n rp FO 

t 

Q£ n FO 



Figure 1: The new landscape. Here G stands for any fixed groupoid, and a thick 
line indicates strict inclusion. 
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A number of open questions are apparent from Figure |j. Clearly, it would be nice 
to separate the FObit-based classes, in particular FObit(Q(>p) f rom FObit(Qcrp)) but 
this is a daunting task. A sensible approach then is to begin with QgLFObit- How 
does this compare with TC° for example? Can we at least separate QqJ. FObit from 
LOGCFL? We know that Q£ n rp FO bit % FO(Q£ n rp ); a witness for this is the set 

{ O™ 2 | n € N } , cf. the proof of Theorem |4.16| , 

Other natural questions prompted by our separation results concern extensions 
and refinements to Figure [l|. For example, in the world with BIT, which spe- 
cific groupoids G are powerful enough to express LOGCFL, and which are not? 
In the world without BIT, given the aperiodic transducer characterization of FO- 



translations, can we prove REG \ (REG n Q^fYO) ^ as easily as Lemma 4.8 
implies CFL \ Q™FO 7^ 0? More importantly, can we hope for an algebraic theory 
of groupoids to explain the detailed structure of CFL, much in the way that an 
elaborate theory of monoids is used in the extensive first-order parameterization of 
REG? 

But perhaps the most fundamental (and hopefully tractable) question arising 
from our work is not apparent from Figure |l]. It concerns the Boolean closure of 
the context-free languages. We have trivially used BC(CFL) (in fact BC + (CFL) 
sufficed) to witness the separation between QJ^. p FO and FO(Qq° p ). But what is 
BC(CFL) exactly, and what techniques are available to prove that a language is not 
in BC(CFL)? It is easy to prove that any non-regular language over a unary alphabet 
does not belong to BC(CFL), and a natural infinite hierarchy within BC + (CFL) is 



known [LW72], but the full question seems to have fallen into the cracks. We have 
several good candidates for membership in FO(QqJ. ) \ BC(CFL), but so far have 
been unable to prove these two classes different. 

Finally, ever since the regular languages in AC and in ACC° were characterized 



(the latter modulo a natural conjecture [ BCST9^ ]), one might have wondered about 



a similar characterization for the context-free languages in these classes, and in 
NC 1 . A unified treatment of LOGCFL subclasses under the banner of first-order 
logic might constitute a useful step towards being able to answer these questions. 
Since circuit-based complexity classes are closed under Boolean operations however, 
a better understanding of the interaction between the complement operation and 
groupoidal quantifiers is required. This once again seems to highlight the importance 
of understanding BC(CFL). 
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